Conditional Value-at-Risk Minimization in Finite State Markov Decision Processes: Continuity and Compactness
نویسنده
چکیده
This study is concerned with the dynamic risk-analysis for finite state Markov decision processes. As a measure of risk, we consider conditional value-at-risk(CVaR) for the real value of the discounted total reward from a policy, under whose criterion risk optimal or deterministic policies are defined. The risk problem is equivalently redefined as a non-linear optimization problem on the attainable set of the distribution functions for the real values over all policies. Showing the weak-continuity of CVaR on the space of attainable distribution functions, the mathematical existence theorem of optimal policies are proved throughout the discussion of convex analysis and weak-compactness. c ©2013 World Academic Press, UK. All rights reserved.
منابع مشابه
Financial Risk Modeling with Markova Chain
Investors use different approaches to select optimal portfolio. so, Optimal investment choices according to return can be interpreted in different models. The traditional approach to allocate portfolio selection called a mean - variance explains. Another approach is Markov chain. Markov chain is a random process without memory. This means that the conditional probability distribution of the nex...
متن کاملOptimal Control of Piecewise Deterministic Markov Processes with Finite Time Horizon
In this paper we study controlled Piecewise Deterministic Markov Processes with finite time horizon and unbounded rewards. Using an embedding procedure we reduce these problems to discrete-time Markov Decision Processes. Under some continuity and compactness conditions we establish the existence of an optimal policy and show that the value function is the unique solution of the Bellman equation...
متن کاملConditional Value-at-Risk for Random Immediate Reward Variables in Markov Decision Processes
We consider risk minimization problems for Markov decision processes. From a standpoint of making the risk of random reward variable at each time as small as possible, a risk measure is introduced using conditional value-at-risk for random immediate reward variables in Markov decision processes, under whose risk measure criteria the risk-optimal policies are characterized by the optimality equa...
متن کاملA Convex Analytic Approach to Risk-Aware Markov Decision Processes
Abstract. In classical Markov decision process (MDP) theory, we search for a policy that say, minimizes the expected infinite horizon discounted cost. Expectation is of course, a risk neutral measure, which does not su ce in many applications, particularly in finance. We replace the expectation with a general risk functional, and call such models risk-aware MDP models. We consider minimization ...
متن کاملPortfolio Optimization Based on Cross Efficiencies By Linear Model of Conditional Value at Risk Minimization
Markowitz model is the first modern formulation of portfolio optimization problem. Relyingon historical return of stocks as basic information and using variance as a risk measure aretow drawbacks of this model. Since Markowitz model has been presented, many effortshave been done to remove theses drawbacks. On one hand several better risk measures havebeen introduced and proper models have been ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013